- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources5
- Resource Type
-
0005000000000000
- More
- Availability
-
50
- Author / Contributor
- Filter by Author / Creator
-
-
Zeng, Yuchen (5)
-
Lee, Kangwook (3)
-
Guldogan, Ozgur (2)
-
Pedarsani, Ramtin (2)
-
Sohn, Jy-yong (2)
-
Chang, Yi-Wei (1)
-
Chen, Yicong (1)
-
Howe, Gregory (1)
-
Kang, Wonjun (1)
-
Koo, Hyung Il (1)
-
Wang, Miaoyan (1)
-
Xu, Min (1)
-
Yi, Kai (1)
-
Zeng, Xiangrui (1)
-
Zhang, Jing (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
& Abramson, C. I. (0)
-
& Abreu-Ramos, E. D. (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
The evolution from Large Language Models (LLMs) to Multimodal Large Language Models (MLLMs) has spurred research into extending In-Context Learning (ICL) to its multimodal counterpart. Existing such studies have primarily concentrated on image-to-text ICL. However, the Text-to-Image ICL (T2I-ICL), with its unique characteristics and potential applications, remains underexplored. To address this gap, we formally define the task of T2I-ICL and present CoBSAT, the first T2I-ICL benchmark dataset, encompassing ten tasks. Utilizing our dataset to benchmark six state-of-the-art MLLMs, we uncover considerable difficulties MLLMs encounter in solving T2I-ICL. We identify the primary challenges as the inherent complexity of multimodality and image generation, and show that strategies such as fine-tuning and Chain-of-Thought prompting help to mitigate these difficulties, leading to notable improvements in performance. Our code and dataset are available at https://github.com/UW-Madison-Lee-Lab/CoBSAT.more » « less
-
Guldogan, Ozgur; Zeng, Yuchen; Sohn, Jy-yong; Pedarsani, Ramtin; Lee, Kangwook (, International Conference on Learning Representations (ICLR))
-
Guldogan, Ozgur; Zeng, Yuchen; Sohn, Jy-yong; Pedarsani, Ramtin; Lee, Kangwook (, International Conference on Learning Representations (ICLR) 2023)
-
Zeng, Yuchen; Howe, Gregory; Yi, Kai; Zeng, Xiangrui; Zhang, Jing; Chang, Yi-Wei; Xu, Min (, 2021 IEEE International Conference on Image Processing (ICIP))
-
Wang, Miaoyan; Zeng, Yuchen (, Advances in Neural Information Processing Systems 32 (NeurIPS))We consider the problem of identifying multiway block structure from a large noisy tensor. Such problems arise frequently in applications such as genomics, recommendation system, topic modeling, and sensor network localization. We propose a tensor block model, develop a unified least-square estimation, and obtain the theoretical accuracy guarantees for multiway clustering. The statistical convergence of the estimator is established, and we show that the associated clustering procedure achieves partition consistency. A sparse regularization is further developed for identifying important blocks with elevated means. The proposal handles a broad range of data types, including binary, continuous, and hybrid observations. Through simulation and application to two real datasets, we demonstrate the outperformance of our approach over previous methods.more » « less
An official website of the United States government

Full Text Available